43 research outputs found
Detecting Tidal Features using Self-Supervised Representation Learning
Low surface brightness substructures around galaxies, known as tidal
features, are a valuable tool in the detection of past or ongoing galaxy
mergers. Their properties can answer questions about the progenitor galaxies
involved in the interactions. This paper presents promising results from a
self-supervised machine learning model, trained on data from the Ultradeep
layer of the Hyper Suprime-Cam Subaru Strategic Program optical imaging survey,
designed to automate the detection of tidal features. We find that
self-supervised models are capable of detecting tidal features and that our
model outperforms previous automated tidal feature detection methods, including
a fully supervised model. The previous state of the art method achieved 76%
completeness for 22% contamination, while our model achieves considerably
higher (96%) completeness for the same level of contamination.Comment: Accepted at the ICML 2023 Workshop on Machine Learning for
Astrophysic
Detecting Galaxy Tidal Features Using Self-Supervised Representation Learning
Low surface brightness substructures around galaxies, known as tidal
features, are a valuable tool in the detection of past or ongoing galaxy
mergers, and their properties can answer questions about the progenitor
galaxies involved in the interactions. The assembly of current tidal feature
samples is primarily achieved using visual classification, making it difficult
to construct large samples and draw accurate and statistically robust
conclusions about the galaxy evolution process. With upcoming large optical
imaging surveys such as the Vera C. Rubin Observatory Legacy Survey of Space
and Time (LSST), predicted to observe billions of galaxies, it is imperative
that we refine our methods of detecting and classifying samples of merging
galaxies. This paper presents promising results from a self-supervised machine
learning model, trained on data from the Ultradeep layer of the Hyper
Suprime-Cam Subaru Strategic Program optical imaging survey, designed to
automate the detection of tidal features. We find that self-supervised models
are capable of detecting tidal features, and that our model outperforms
previous automated tidal feature detection methods, including a fully
supervised model. An earlier method achieved 76% completeness for 22%
contamination, while our model achieves considerably higher (96%) completeness
for the same level of contamination. We emphasise a number of advantages of
self-supervised models over fully supervised models including maintaining
excellent performance when using only 50 labelled examples for training, and
the ability to perform similarity searches using a single example of a galaxy
with tidal features.Comment: 11 pages, submitted to MNRAS. arXiv admin note: text overlap with
arXiv:2307.0496
Modeling halo and central galaxy orientations on the SO(3) manifold with score-based generative models
Upcoming cosmological weak lensing surveys are expected to constrain
cosmological parameters with unprecedented precision. In preparation for these
surveys, large simulations with realistic galaxy populations are required to
test and validate analysis pipelines. However, these simulations are
computationally very costly -- and at the volumes and resolutions demanded by
upcoming cosmological surveys, they are computationally infeasible. Here, we
propose a Deep Generative Modeling approach to address the specific problem of
emulating realistic 3D galaxy orientations in synthetic catalogs. For this
purpose, we develop a novel Score-Based Diffusion Model specifically for the
SO(3) manifold. The model accurately learns and reproduces correlated
orientations of galaxies and dark matter halos that are statistically
consistent with those of a reference high-resolution hydrodynamical simulation.Comment: Accepted as extended abstract at Machine Learning and the Physical
Sciences workshop, NeurIPS 202
Differentiable Stochastic Halo Occupation Distribution
In this work, we demonstrate how differentiable stochastic sampling
techniques developed in the context of deep Reinforcement Learning can be used
to perform efficient parameter inference over stochastic, simulation-based,
forward models. As a particular example, we focus on the problem of estimating
parameters of Halo Occupancy Distribution (HOD) models which are used to
connect galaxies with their dark matter halos. Using a combination of
continuous relaxation and gradient parameterization techniques, we can obtain
well-defined gradients with respect to HOD parameters through discrete galaxy
catalogs realizations. Having access to these gradients allows us to leverage
efficient sampling schemes, such as Hamiltonian Monte-Carlo, and greatly speed
up parameter inference. We demonstrate our technique on a mock galaxy catalog
generated from the Bolshoi simulation using the Zheng et al. 2007 HOD model and
find near identical posteriors as standard Markov Chain Monte Carlo techniques
with an increase of ~8x in convergence efficiency. Our differentiable HOD model
also has broad applications in full forward model approaches to cosmic
structure and cosmological analysis.Comment: 10 pages, 6 figures, comments welcom
CMU DeepLens: Deep Learning For Automatic Image-based Galaxy-Galaxy Strong Lens Finding
Galaxy-scale strong gravitational lensing is not only a valuable probe of the
dark matter distribution of massive galaxies, but can also provide valuable
cosmological constraints, either by studying the population of strong lenses or
by measuring time delays in lensed quasars. Due to the rarity of galaxy-scale
strongly lensed systems, fast and reliable automated lens finding methods will
be essential in the era of large surveys such as LSST, Euclid, and WFIRST. To
tackle this challenge, we introduce CMU DeepLens, a new fully automated
galaxy-galaxy lens finding method based on Deep Learning. This supervised
machine learning approach does not require any tuning after the training step
which only requires realistic image simulations of strongly lensed systems. We
train and validate our model on a set of 20,000 LSST-like mock observations
including a range of lensed systems of various sizes and signal-to-noise ratios
(S/N). We find on our simulated data set that for a rejection rate of
non-lenses of 99%, a completeness of 90% can be achieved for lenses with
Einstein radii larger than 1.4" and S/N larger than 20 on individual -band
LSST exposures. Finally, we emphasize the importance of realistically complex
simulations for training such machine learning methods by demonstrating that
the performance of models of significantly different complexities cannot be
distinguished on simpler simulations. We make our code publicly available at
https://github.com/McWilliamsCenter/CMUDeepLens .Comment: 12 pages, 9 figures, submitted to MNRA
Galaxies on graph neural networks: towards robust synthetic galaxy catalogs with deep generative models
The future astronomical imaging surveys are set to provide precise
constraints on cosmological parameters, such as dark energy. However,
production of synthetic data for these surveys, to test and validate analysis
methods, suffers from a very high computational cost. In particular, generating
mock galaxy catalogs at sufficiently large volume and high resolution will soon
become computationally unreachable. In this paper, we address this problem with
a Deep Generative Model to create robust mock galaxy catalogs that may be used
to test and develop the analysis pipelines of future weak lensing surveys. We
build our model on a custom built Graph Convolutional Networks, by placing each
galaxy on a graph node and then connecting the graphs within each
gravitationally bound system. We train our model on a cosmological simulation
with realistic galaxy populations to capture the 2D and 3D orientations of
galaxies. The samples from the model exhibit comparable statistical properties
to those in the simulations. To the best of our knowledge, this is the first
instance of a generative model on graphs in an astrophysical/cosmological
context.Comment: Accepted as extended abstract at ICML 2022 Workshop on Machine
Learning for Astrophysics. Condensed version of arXiv:2204.0707